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Attorney Docket No. 8336A-000020 

HYPERTEXT MARKUP LANGUAGE 
CACHE SYSTEM AND METHOD 

Technical Field 

5 This invention relates to cache systems. More particularly, this 

invention relates to the use of cache systems in a set top box environment for 
accessing hypertext markup language (HTML) documents. 

BACKGROUND OF THE INVENTION 

10 Set top boxes typically provide an interface between a television 

display or peripheral and a multimedia source that includes audio and video 
content. In some instances, the set top box is integrated with the television. 
The set top boxes are connected by cable systems (coaxial, fiberoptic, etc.), 
satellite-based systems, and other terrestrial-based systems to one or more 

15 media providers. The multimedia sources predominantly provide video that 
contains moving images. In the future, set top boxes will also be required to 
interface with distributed communications systems such as the Internet. 

When a user accesses the Internet, the user typically employs a web 
browser that issues hypertext transfer protocol (HTTP) requests using 

20 Transmission Control Protocol/Internet Protocol (TCP/IP) to a host server. 
The host server runs software that responds to the HTTP requests. When the 
user requests a web page, the host server retrieves the web page and sends 
it to the user over the Internet. Web pages or HTML documents are made up 
of HTML text plus images, audio, video, scripts, fonts, and other media types. 
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The current web page access times that are required to retrieve a 
requested web page are commercially unacceptable. Web users do not like 
to wait very long to load web pages into their browsers. As loading speed 
increases, the users tend to lose interest. Loading speed continues to be the 
5 number one problem of web browsers. 

SUMMARY OF THE INVENTION 

A cache management system for a set top box improves the loading 
speed of hypertext markup language (HTML) documents that are provided by 

10 web servers through a distributed communications system. The cache 
management system includes a set top box with a processor and memory that 
includes cache. HTML documents are stored in the cache. A cache manager 
manages the cache and calculates a removal factor for each of the HTML 
documents. The cache manager removes the HTML documents based on its 

15 removal factor until sufficient room is available for an additional HTML 
document when a user requests access to the additional HTML document and 
insufficient cache is available to store the additional HTML document. The 
cache manager keeps as many relevant documents as possible in the cache. 
In other features of the invention, the removal factor is at least partially 

20 based on usage of the HTML documents such as a number of times that the 
HTML documents was used and how long since the HTML documents were 
last used. 

In still other features of the invention, the removal factor is at least 
partially based on a size of the HTML documents, a file type of the HTML 
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documents, whether the HTML documents are compressed or decompressed, 
and/or a developer-designated priority for the HTML documents. 

In yet other features of the invention, the removal factor is calculated 
by multiplying a first factor by a first weight to generate a first product, a 

5 second factor by a second weight to generate a second product, and an n th 
factor by an n th weight to generate an n* product. The first, second and nth 
products are summed. The first, second and n th products can be normalized 
prior or after being summed. 

In other features of the invention, a table is accessible by the cache 

10 manager and is stored in memory of the set top box. The HTML documents 
are arranged in an ordered list based on the removal factors. 

In still other features of the invention, the cache manager employs 
hysteresis when removing the HTML documents from the cache to create 
room for the additional HTML document when the user requests access to the 

15 additional HTML document and insufficient cache is available to store the 
additional HTML document. The cache manager provides hysteresis by 
continuing to remove the HTML documents until a predetermined breathing 
room threshold is available. 

Still other objects, features, and advantages will be readily apparent 

20 from the specification, the drawings, and the claims. 



DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a functional block diagram of a set top box system 
according to the invention; 

FIG. 2 is a function block diagram of a web server; 
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FIG. 3 is a functional block diagram of a set top box; 

FIG. 4A is a flowchart illustrating steps for managing the cache 
of the set top box when web pages are requested by a user; 

FIG. 4B illustrates the flowchart of FIG. 4A with additional steps 
5 for calculating removal factors and for updating an ordered list; 

FIG. 5A is a flowchart illustrating steps for removing cache items 
from the cache when a new web page has been requested; 

FIG. 5B illustrates the flowchart of FIG. 5A with additional steps 
for removing "useOnce" web pages and expired web pages and for 
10 maintaining cache and system comfort levels; 

FIG. 6 is a flowchart illustrating steps for periodically maintaining 

the cache; 

FIG. 7A illustrates a table for managing cache items; 
FIG. 7B illustrates a table for storing factor weights according to 
15 cache item type; 

FIG. 8 is a data flow view illustrating the calculation of removal 

factors; and 

FIG. 9 is a data flow view illustrating a preferred way of 
calculating the removal factors. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The ensuing detailed description provides preferred exemplary 
embodiments only and is not intended to limit the scope, applicability or 
configuration of the present invention. Rather, the ensuing detailed 
5 description of the preferred exemplary embodiments will provide those skilled 
in the art with an enabling description for implementing preferred exemplary 
embodiments of the invention. It being understood that various changes may 
be made in the function and arrangement of the elements without departing 
from the spirit and scope of the invention as set forth in the appended claims. 

10 Referring to FIGs. 1 and 2, a set top box system 10 is illustrated. The 

set top box 10 includes one or more web servers 14-1, 14-2, ...,14-n that are 
connected to the Internet 18. A gateway 20 connects a distributed 
communications system (DCS) 22 to the Internet 18. The gateway 20 can be 
a proxy server, a router, a fireway or any other suitable gateway. The web 

15 server 14 can be connected to the DCS 22. A media provider 24 is connected 
to the DCS 22. Alternately, the media provider 24 can be connected to the 
Internet 18. One or more set top boxes 26-1, 26-2, 26-n are connected to 
the DCS 22. The media provider 24 provides multimedia content via the DCS 
22 to the set top boxes 26. A television (TV) display 30 is connected to each 

20 set top box 26. 

Skilled artisans can appreciate that connections between the Internet 
18, the web servers 14 and the gateway 20 and between the set top box 26, 
the media provider 24 and the DCS 22 can be any suitable transmission 
media such as a cable connection (coaxial cable or fiberoptic cable), a cable- 

25 based modem, a phone-based modem, a local area network (LAN), a wide 
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area network (WAN), a satellite-based connection, a terrestrial-based 
connection, or any other suitable connection. IEEE 1394 wire may also be 
employed. Additionally, the set top box 26 can be integrated with electronics 
that are associated with the display 30 if desired. The display 30 can be a TV, 
5 a network TV display, or other suitable displays. 

In use, the web servers 14 deliver web pages that are requested by the 
user of the set top box 26. The web server 14 includes an input/output (I/O) 
interface 72 that is connected to a processor in memory 78. The web server 
14 is connected to the Internet 18 or the DCS 22. The memory 78 preferably 

10 includes read only memory (ROM), Random Access Memory (RAM) and/or 
external storage such as floppy drives, hard drives, optical drives, and other 
suitable storage media. The memory 78 includes an operating system/server 
module 84. The web server 14 is connected to the Internet 18 in any of the 
ways that were previously described. The web server 14 typically includes a 

15 display 102, a keyboard 104, a mouse 106, and other suitable I/O devices 
108. 

Referring now to FIG. 3, the set top box 22 is illustrated in further detail 
and includes an I/O interface 150, a processor 154, and memory 156. The 
set top box 26 is connected to the DCS 22. The memory 156 includes 

20 Random Access Memory (RAM) and Read Only Memory (ROM), and/or other 
electronic storage such as a hard drive, floppy drive, an optical storage 
device, or any other suitable electronic memory storage. When the set top 
box 26 is turned on, an operating system 160 is loaded into memory 156. A 
set top box browser 162 is also loaded into memory 156 when needed. A 

25 portion of the memory 156 is used for cache 164. Cache items include web 
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pages or HTML documents that include HTML text plus images, audio, video, 
scripts, fonts such as MicroType® and TrueType® and scripts such as 
JavaScript® that are stored in the cache 164. Still other media types that will 
be developed in the future are contemplated. A cache manager 166 manages 
5 the cache 164. The cache manager 166 is associated with one or more 
tables 168 that contain information for managing the cache 164 as will be 
described further below. 

The set top box 26 is connected to the DCS 22 in any of the ways 
described above. The display 30 is connected to the I/O interface 150. The 
10 set top box 26 can include I/O devices 176 such as a keyboard 180, a remote 
184, a joystick (not shown), a game pad (not shown), or any other suitable I/O 
devices. 

Referring now to FIG. 4A, a flowchart illustrating steps for managing 
the cache 164 using the cache manager 166 is illustrated and is generally 

15 designated 200. Control begins at step 202. Control continues with step 204 
where control determines whether the user has requested a web page using a 
uniform resource locator (URL). If not, control loops back to step 204. 

Otherwise control continues with step 206 where control determines 
whether the requested web page is dynamically generated. Examples of 

20 dynamically generated web pages include JavaScript® web pages having a 
".jsp" extension and active server pages having a ".asp" extension. Skilled 
artisans will appreciate that the present invention has application to other 
dynamically generated web page formats that currently exist and/or that will 
be created in the future. If the web page is not dynamically generated, control 

25 continues with step 208 where control determines whether the web page 
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requested by the user exists in the cache 164. If the web page exists in the 
cache 164, control continues with step 212. Control outputs the web page to 
the display 30 and continues with step 204. 

If the web page is dynamically generated, control continues from step 
206 to step 216. If the web page is not dynamically generated and the web 
page does not exist in cache, control continues from step 208 to step 216. In 
step 216, control retrieves the requested web page from the web server 14 
using the URL. 

Control continues with step 220 where control determines if sufficient 
cache is available to store the web page. If insufficient cache is available, 
control clears sufficient cache to store the web page in step 224. In step 226, 
control determines whether the web page is dynamically generated. If not, 
control stores the web page in the cache in step 228. Control continues with 
step 230 where control outputs the web page to the display 30. If the web 
page is dynamically generated, control continues from step 226 to step 230. 
Control continues from step 230 to step 204. 

Referring now to FIG. 4B, a second flowchart illustrating steps for 
managing the cache 164 using the cache manager 166 is generally 
designated 200'. Reference numbers from FIG. 4A have been used with a 
prime ( £ ) to denote similar elements where appropriate. Control continues 
after step 212' and calculates removal factors at step 232. In step 234, control 
updates an ordered list based on the updated removal factors. 

Removal factors are used to rank the cache items that are stored in the 
cache 164. Cache items with a higher removal factor take precedence over 
cache items with lower removal factors. The cache items are organized in the 
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ordered list to increase the speed of the cache management process. 
Removal factors will be described further below in connection with FIGs. 7A, 
7B, 8, and 9. 

Referring now to FIG. 5A, a first flowchart illustrates steps performed 
5 by the cache manager 1 66 to clear sufficient cache (steps 224 and 224' in 
FIGs. 4A and 4B). Control begins with step 250. At step 252, control 
determines whether the web page size is greater than the unused cache. If 
not, control stores the web page in cache and outputs the web page to the 
display 30 in step 256'. In step 260, control returns to step 202. 

10 If the web page size exceeds the unused cache, control continues with 

step 264 where control identifies a web page with the lowest removal factor. 
At step 268, control deletes the web page with the lowest removal factor and 
returns to step 252. Control also preferably removes web pages that were 
preloaded when the deleted web page was initially loaded. Control loops 

15 through steps 252, 264 and 268 until the web page size is less than the 
unused cache. When the web page size is less than the unused cache, 
control continues with steps 256 and 260. 

Referring now to FIG. 5B, a second flowchart illustrates steps 
performed by the cache manager 166 to clear sufficient cache (steps 224 and 

20 224' in FIGs. 4A and 4B) to provide hystersis or breathing room. Control 
begins at step 280. Control continues with step 282 where control determines 
whether the web page size exceeds available system memory minus a 
breathing room threshold. If is does, control continues with step 318 where 
control removes web pages with the tag "useOnce" and removes expired web 
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pages. The "useOnce" and expiration tags are set by the content developers 
of the web site and/or by the set top box 26. 

Control continues with step 322 where control determines whether the 
web page exceeds the available system memory minus the breathing room 
5 threshold. The extra amount of memory that is cleared for breathing room 
reduces the number of times that the process of deleting the web pages in 
cache will be performed. If not, control continues with step 326 where control 
identifies the web page with the lowest removal factor. Control continues with 
step 330 where control deletes the web page with the lowest removal factor. 
J 10 Control also preferably identifies and deletes web pages that were preloaded 

r& when the deleted web page was initially loaded (if the related web page(s) are 

ry not used). Control loops through steps 322, 326 and 330 until the web page 

% * exceeds the available system memory minus the breathing room threshold. 

If the web page size does not exceed the available system memory 
m 15 minus the breathing room threshold (as determined in steps 282 or 322), 

(3 control continues with step 332. In step 332, control stores the web page in 

cache and displays the web page on the display 30. In step 336, control 
continues with step 202. 

Referring now to FIG. 6, an alternate method of managing the cache is 
20 illustrated at 350. Housekeeping of the cache 164 is performed on a periodic 
basis. Control begins with step 352. Control continues with step 356 where 
control determines whether a timer has expired. If not, control loops back to 
step 356. Otherwise, control continues with step 360 where control removes 
web pages with a tag "useOnce". Control continues with step 364 where 
25 control removes web pages that have expired. 
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Control continues with step 370 where control determines whether the 
available system memory is greater than the breathing room threshold. If not, 
control continues with step 372 where control identifies a web page with the 
lowest removal factor. At step 376, control deletes the web page with the 
5 lowest removal factor. Control also preferably identifies and deletes web 
pages that were preloaded when the deleted web page was initially loaded (if 
the related web page(s) were not used). Control loops through steps 370, 
372 and 376 until the available system memory exceeds the breathing room 
threshold. Control may also check whether available system memory 
10 increases after looping through steps 370, 372 and 376 to prevent endless 
looping when system memory is taken by other set top functions and no web 
pages are stored in cache. Similar checks may be performed in FIGs. 4A, 4B, 
5A and 5B. 

When the available system memory exceeds the breathing room 
15 threshold, control continues with step 380. In step 380, control updates the 
removal factors. In step 382, control updates the ordered list of removal 
factors and continues with step 356. 

Referring now to FIG. 7A, a table 400 containing removal factors is 
illustrated. The table 400 includes a plurality of columns 402 and a plurality of 
20 rows 404. The table 400 includes a key column 406 that contains a key that 
identifies a web page. Additional columns include a "LastUsed" column that 
identifies the last time that the web page was used. A "TimesUsed" column 
identifies how many times the cache item has been used since being stored in 
the cache 164. A "Size" column 412 identifies the size of the cache item. A 
25 "Type" column indicates the media type of the cache item such as video, text, 



11 



audio, etc. A "C/D" column 418 indicates whether the cache item is 
compressed ("C") or decompressed ("D"). A "Priority" column 420 indicates a 
developer-assigned priority for the cache item. The developer-assigned 
priority can be set at "0" that is the default (or no priority specified) or "1" to 
5 "9". "1" is the lowest priority and "9" is the highest priority. A "Removal 
Factor" column 424 is a calculated factor that will be described further below. 
Preferably, the table is sorted by the "Removal Factor" column in a lowest-to- 
highest or in a highest-to-lowest order depending upon how the table is 
accessed. 

10 Referring now to FIG. 7B, a second table 430 is illustrated. The table 

430 contains rows 432 and columns 434 that define the weightings that will be 
given to the columns of the table 400 in calculating the removal factor for a 
particular cache item. The columns include a Type" key column that 
corresponds to the "Type" column 414 in FIG. 7A. The table 430 includes 

15 factor weightings 440, 442, 444, 446, 448 and 450 that are associated with 
the columns of the table 400 in FIG. 7A. For example, the factor weighting 
440 is associated with the "LastUsed" column 408. The factor weighting 442 
is associated with the "Timesllsed" column 410. The factor weighting 444 is 
associated with the "Size" column 412. The factor weighting 446 is 

20 associated with the "Type" column 414. The factor weighting 448 is 
associated with the "C/D" column 418. The factor weighting 450 is associated 
with the "Priority" column 420. The particular row from the table 430 is 
selected based on the data in the "Type" column 414 for the particular web 
page. 
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Referring now to FIG. 7C, table 454 contains values that are 
substituted for the "Type" column values when calculating the removal factors. 
Referring now to FIG. 7D, table 458 contains values that are substituted for 
the "C/D" column values when calculating the removal factors. Referring now 
to FIG. 7E, table 462 contains values that are substituted for the TimesUsed" 
column values when calculating the removal factors. The "LastUsed" column 
is preferably is a difference between either a fixed date or the oldest date in 
the table. The difference is preferably normalized to create a factor. 

Referring now to FIG. 8, the calculation of the removal factors is 
illustrated. An arithmetic function generator 472 generates an output based 
on one or more inputs ( F(input1, input2, input n) ). A first arithmetic factor 
474, a second arithmetic factor 476 and an n th arithmetic factor 478 are input 
to the arithmetic function generator 472. One or more weights may be stored 
in a table and input to the arithmetic function generator 472. 

A logical function generator 482 includes one or more inputs. A first 
logical factor 484, a second logical factor 486 and an n th logical factor 490 are 
input to the logical function generator 482. The logical function generator 482 
generates values from non-numerical data such as "Type", "C/D", etc. The 
logical function generator 482 provides value outputs using lookup tables 
and/or logical outputs such as enable and disable outputs. For example, if the 
set top box includes hardwired or hardware accelerated circuits for 
decompressing MPEG images, the decompressed images can be deleted 
without a significant loss of performance. If the HTML document is a 
decompressed MPEG image, the logical function generator can output a 
disable signal to set the removal factor equal to zero - effectively eliminating 
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the decompressed MPEG image from the cache. The removal factor 
generator 472 outputs a removal factor based on arithmetic and/or logical 
functions. The removal factor is stored in the table 402. The removal factor 
calculations are repeated for each cache item. 

Referring now to FIG 9, a data flow view illustrating the calculations 
that are performed by the cache manager 166 to calculate the removal factors 
according to a preferred embodiment of the invention is shown at 500. The 
cache manager 166 selects a first cache item (such as "www.add...") in the 
table 400. The cache manager 166 selects a first factor (such as the priority 
of the cache item - in this example the priority is "8") from the table 400 and 
stores the first factor at 502. The cache manager 166 selects a first factor 
weight from the table 430 by using the data value of the "Type" column ("A") 
for the first cache item as a lookup (in this example the first factor weight 
would be "10" that is found in column 450). The first factor weight is 
temporarily stored at 504. 

The cache manager 166 selects a second factor (such as the "Type" of 
the cache item - in this example the type is "A") from the table 400. The 
cache manager 166 selects the value ("20") from the table 454 and 
temporarily stores the second factor at 504. The cache manager 166 selects 
a second factor weight from the table 430 by using the data value of the 
"Type" column ("A") for the first cache item as a lookup. In this example, the 
second factor weight would be "20" that is found in column 446. The cache 
manager temporarily stores the second factor weight at 512. The cache 
manager 166 selects 0 to n additional factors and factor weights in a similar 
manner as indicated at 516 and 520. 
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The cache manager 166 multiplies the first factor by the first factor 
weight at 522 to generate a first product 524. The cache manager 166 
multiplies the second factor by the second factor weight at 526 to generate a 
second product 528. The cache manager 166 multiplies the n th factor by the 
5 n th factor weight at 530 to generate an n th product 532. Normalization or other 
conventional processing can be performed on the factors, the factor weights, 
the sums and/or the products if desired. The products 524, 528 and 532 are 
added at 536 to generate a removal factor 540 for the cache item. The cache 
manager 166 repeats the steps for other cache items. Subsequently the 
O 10 cache manager 166 updates the ordering of the table 400 to reflect the 

^ changes to the removal factors. A lowest-to-highest or highest-to-lowest 

ifj ordering can be used depending on how the table 400 is accessed. 

5| Skilled artisans can appreciate that still other methods of calculating 

P the removal factors are contemplated. The cache manager 166 can calculate 

jj: 15 the removal factors based on a function F that depends on one or more of the 

usage and/or data type factors for each cache item. 

As can be appreciated from the foregoing, the access times that are 
required to retrieve a requested web page in a set top box environment is 
significantly reduced by the cache manager according to the present 
20 invention. Furthermore, management of the cache is optimized through the 
use of an optimized heuristic that takes into account usage, developer- 
designated priority, data type, compression status, size and/or other pertinent 
factors. 

Those skilled in the art can now appreciate from the foregoing 
25 description that the broad teachings of the present invention can be 



implemented in a variety of forms. Therefore, while this invention has been 
described in connection with particular examples, thereof, the true scope of 
the invention should not be so limited since other modifications will become 
apparent to the skilled practitioner upon a study of the drawings, specification 
and the following claims. 
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What is claimed is: 

1 . A cache management system for a set top box that improves the 
loading speed of hypertext markup language (HTML) documents that are 
provided by web servers through a distributed communications system, 
comprising: 

5 a set top box including a processor and memory, wherein said 

memory includes cache; 

a plurality of HTML documents that are stored in said cache; 

and 

a cache manager that manages said cache and that calculates a 
1 0 removal factor for each of said HTML documents, 

wherein said cache manager removes at least one of said HTML 
documents based on its removal factor until sufficient room is available for an 
additional HTML document when a user requests access to said additional 
HTML document and insufficient cache is available to store said additional 
15 HTML document in said cache. 

2. The cache management system of claim 1 wherein said removal 
factor is calculated using at least one of a logical and an arithmetic function. 

3. The cache management system of claim 1 wherein said removal 
factor is at least partially based on a number of times that said at least one of 
said HTML documents was used while it was stored in cache. 



17 



4. The cache management system of claim 1 wherein said removal 
factor is at least partially based on how long since said at least one of said 
HTML documents was last used. 

5. The cache management system of claim 1 wherein said removal 
factor is at least partially based on a size of said at least one of said HTML 
documents. 

6. The cache management system of claim 1 wherein said removal 
factor is at least partially based on a file type of said at least one of said HTML 
documents. 

7. The cache management system of claim 1 wherein said removal 
factor is at least partially based on whether said at least one of said HTML 
documents is compressed or decompressed. 

8. The cache management system of claim 1 wherein said removal 
factor is at least partially based on a content developer-designated priority for 
said at least one of said HTML documents. 

9. The cache management system of claim 1 wherein said removal 
factor is calculated by multiplying a first factor by a first weight to generated a 
first product, a second factor by a second weight to generate a second 
product, and an n th factor by an n th weight to generate an n th product. 
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10. The cache management system of claim 9 wherein said first, 
second and nth products are summed. 

11. The cache management system of claim 10 wherein said first, 
second and n th products are normalized prior to being summed. 

12. The cache management system of claim 10 wherein said first 
factor is normalized prior to being multiplied by said first weight. 

13. The cache management system of claim 1 wherein said cache 
manager employs a lookup table when calculating said removal factors. 

14. The cache management system of claim 1 further comprising: 

a table that is accessible by said cache manager, that is stored 
in memory of said set top box and that arranges said HTML documents in an 
ordered list based on said removal factors. 

15. The cache management system of claim 1 wherein said cache 
manager employs hysteresis when removing said HTML documents from said 
cache to create room for said additional HTML document when said user 
requests access to said additional HTML document and insufficient cache is 
available to store said additional HTML document. 
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16. The cache management system of claim 15 wherein said cache 
manager provides hysteresis by continuing to remove said HTML documents 
until a predetermined extra amount of said cache is available. 

17. The cache management system of claim 1 wherein said cache 
manager continues to remove said HTML documents until available system 
memory is above a threshold. 

18. The cache management system of claim 1 wherein said cache 
manager removes preloaded HTML documents that are related to said at 
least one of said HTML documents that is removed based on its removal 
factor. 
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19. A method for managing cache in a set top box to improve the 
loading speed of hypertext markup language (HTML) documents that are 
provided by web servers through a distributed communications system, 
comprising the steps of: 

storing a plurality of HTML documents in said cache; 
calculating a removal factor for each of said HTML documents, 

and 

removing at least one of said HTML documents based on its 
removal factor until sufficient room is available for an additional HTML 
document when a user requests access to said additional HTML document 
and insufficient cache is available to store said additional HTML document. 

20. The method of claim 19 wherein said removal factor is 
calculated using at least one of an arithmetic function and a logical function. 

21. The method of claim 19 wherein said removal factor is at least 
partially based on a number of times that said at least one of said HTML 
documents was used. 

22. The method of claim 19 wherein said removal factor is at least 
partially based on how long since said at least one of said HTML documents 
was last used. 

23. The method of claim 19 wherein said removal factor is at least 
partially based on a size of said at least one of said HTML documents. 



21 



24. The method of claim 19 wherein said removal factor is at least 
partially based on a file type of said at least one of said HTML documents. 

25. The method of claim 19 wherein said removal factor is at least 
partially based on whether said at least one of said HTML documents is 
compressed or decompressed. 

26. The method of claim 19 wherein said removal factor is at least 
partially based on a content developer-designated priority for said at least one 
of said HTML documents. 

27. The method of claim 19 wherein said step of calculating said 
removal factors includes the steps of: 

multiplying a first factor by a first weight to generated a first 

product; 

multiplying a second factor by a second weight to generate a 
second product; and 

multiplying an n th factor by an n th weight to generate an n th 

product. 

28. The method of claim 27 wherein said step of calculating said 
removal factors includes the step of: 

summing said first, second and n th products. 
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29. The method of claim 28 wherein said step of calculating said 
removal factors includes the step of: 

normalizing said first, second and third products prior to said 
summing step. 

30. The method of claim 27 wherein said first factor is normalized 
prior to being multiplied by said first weight. 

31 . The method of claim 19 wherein said step of calculating includes 
the step of: 

using a lookup table. 

32. The method of claim 18 further comprising the step of: 
arranging said HTML documents in an ordered list based on 

said removal factors. 

33. The method of claim 19 further comprising the step of: 
employing hysteresis when removing said HTML documents 

from said cache to create room for said additional HTML document when said 
user requests access to said additional HTML document and insufficient 
cache is available to store said additional HTML document. 

34. The method of claim 33 further comprising the step of: 
providing hysteresis by continuing to remove said HTML 

documents until a predetermined extra amount of said cache is available. 
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35. The method of claim 19 further comprising the step of: 
removing said HTML documents until available system memory 

is above a threshold. 

36. The method of claim 19 further comprising the step of: 
removing preloaded HTML documents that are related to said at 

least one of said HTML documents that is removed based on its removal 
factor. 
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HYPERTEXT MARKUP LANGUAGE 
CACHE SYSTEM AND METHOD 

ABSTRACT 

A cache management system for a set top box that improves the 
loading speed of hypertext markup language (HTML) documents that are 
provided by web servers through a distributed communications system. The 
cache management system includes a set top box with a processor and 
memory that includes cache. A plurality of HTML documents stored in the 
cache. A cache manager manages the cache and calculates a removal factor 
for each of the HTML documents. The cache manager removes at least one 
of the HTML documents based on its removal factor until sufficient room is 
available for an additional HTML document when a user requests access to 
the additional HTML document and insufficient cache is available to store the 
additional HTML document. A table is accessible by the cache manager and 
is stored in memory of the set top box. The HTML documents are arranged in 
an ordered list in the table based on the removal factors. The cache manager 
employs hysteresis when removing the HTML documents from the cache to 
create room for the additional HTML document when the user requests 
access to the additional HTML document and insufficient cache is available to 
store the additional HTML document. The cache manager provides 
hysteresis by continuing to remove the HTML documents until a 
predetermined extra amount of the cache is available. The cache manager 
also continues to remove the HTML documents until available system 
memory is above a breathing room threshold. The cache manager keeps the 
maximum number of relevant web pages in cache to maximize loading speed. 
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Attorney Docket No. 8336A-000020 



DECLARATION AND POWER OF ATTORNEY 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

I believe I am the original, first and sole inventor (if only one name is listed below) or an 
original, first and joint inventor (if plural names are listed below) of the subject matter which is 
claimed and for which a patent is sought on the invention entitled 



HYPERTEXT MARKUP LANGUAGE CACHE SYSTEM AND METHOD 
the specification of which (check one) 

[X] is attached hereto. 




I ] 



was filed on 
Serial No. 



and 



was 
_ (if applicable) 



as Application 
amended on 



I hereby state that I have reviewed and understand the contents of the above identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information that is material to the patentability of the 
invention claimed in this application, or information that is material to the examination of this 
application, in accordance with Title 37, Code of Federal Regulations, section 1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, section 1 19(a)-(d) of 
any foreign application(s) for patent or inventor's certificate listed below and have also 
identified below any foreign application for patent or inventor's certificate having a filing date 
before that of the application on which priority is claimed: 



PRIOR FOREIGN APPLICATION(S) 

Priority Claim 



(Number) 


(Country) 


(Day/Month/Year filed) 


Yes 


No 


(Number) 


(Country) 


(Day/Month/Year filed) 


Yes 


No 



(Number) (Country) (Day/Month/Year filed) Yes 
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DECLARATION AND POWER OF ATTORNEY 



I hereby claim the benefit under Title 35, United States Code, §1 19(e) of any United States 
Provisional application (s) listed below: 

PRIOR PROVISIONAL APPLICATIONS 



(application serial number) (Month / Day / Year filed) 



(application serial number) (Month / Day / Year filed) 



I hereby claim the benefit under Title 35, United States Code, section 120 of any United States 
application(s) listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States application in the manner provided by the 
first paragraph of Title 35, United States Code, section 1 12, 1 acknowledge the duty to disclose 
material information as defined in Title 37, Code of Federal Regulations, section 1.56 which 
became available between the filing date of the prior application and the national or PCT 
international filing date of this application: 

Application Serial No. Filing Date Status - patented, 

pending, abandoned 



I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 

I hereby appoint Gregory A. Stobbs, Reg. No. 28764, Michael W. Wiggins, Reg. No. 34754, 
and each principal, attorney of counsel, associate and employee of Harness, Dickey & Pierce, 
P.L.C., who is a registered Patent Attorney, my attorney with full power of substitution and 
revocation, to prosecute this application and to transact all business in the Patent and 
Trademark Office connected therewith. I request the Patent and Trademark Office to direct all 
correspondence and telephone calls relative to this application to Harness, Dickey & Pierce, 
P.L.C., P. O. Box 828, Bloomfield Hills, Michigan 48303 (248) 641-1600. 



Full name of sole or first inventor: Victor Tso 



Inventor's signature: 

Date: 

Residence: 788 Hollv Oak Dr.. Palo Alto California 94303 

Citizenship: United States of America 

Post Office Address: Same 
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Full name of second joint inventor: Brian Knittel 

Second Inventor's signature: 

Date: 

Residence: 859 Northampton Drive, Palo Alto, California 94303 

Citizenship: United States of America 

Post Office Address: Same 
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